Picture for Qi Wang

Qi Wang

Lattice

Policy and World Modeling Co-Training for Language Agents

Add code
Jun 01, 2026
Viaarxiv icon

RLVR without Ineffective Samples: Group Prioritized Off-Policy Optimization for LLM Reasoning

Add code
May 31, 2026
Viaarxiv icon

Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient

Add code
May 26, 2026
Viaarxiv icon

StreamChar: Long-Horizon Streaming Character Audio-Video Generation with Decoupled Orchestration

Add code
May 25, 2026
Viaarxiv icon

OctCGS: Octree-Contextual Gaussian Splatting with Explicit Multi-Order Propagation Modeling for Channel Knowledge Map Construction

Add code
May 21, 2026
Viaarxiv icon

STAR-IOD: Scale-decoupled Topology Alignment with Pseudo-label Refinement for Remote Sensing Incremental Object Detection

Add code
May 20, 2026
Viaarxiv icon

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

Add code
May 07, 2026
Viaarxiv icon

The Fourth Challenge on Image Super-Resolution ($\times$4) at NTIRE 2026: Benchmark Results and Method Overview

Add code
Apr 16, 2026
Viaarxiv icon

Judge Like Human Examiners: A Weighted Importance Multi-Point Evaluation Framework for Generative Tasks with Long-form Answers

Add code
Apr 14, 2026
Viaarxiv icon

Batch Loss Score for Dynamic Data Pruning

Add code
Apr 06, 2026
Viaarxiv icon